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Like recursive-descent but parser can "predict" which 
production to use 

• By looking at the next few tokens 

• No backtracking 

Predictive parsers accept LL(k) grammars 

L means "left-to-right" scan of input 

L means "leftmost derivation" 

• k means "predict based on k tokens of lookahead" 
In practice, LL(i) is used 
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• In recursive-descent, 

• At each step, many choices of production to use 

• Backtracking used to undo bad choices 

• In LL(i), 

• At each step, only one choice of production 

• That is 

• When a non-terminal A is leftmost in a derivation 

• The next input symbol is t 

• There is a unique production A a to use 

Or no production to use (an error state) 

LL(i) is a recursive descent variant without backtracking 
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Predictive 
Parsinq and 

Left 

Factoring 



Recall the grammar 




Hard to predict because 

• For two productions start with int 

For E it is not clear how to predict 



It is not easy to predict which production to use based on 
only a single token 

• This grammar is not LL(i) 

We need to left-factor the grammar (Rewrite the grammar) 
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Recall the grammar 

E -»T + E IT 



T 




int 



int *T I ( E ) 



Factor out common prefixes of productions into a 
single production 



Introduce a new non-terminal for the suffixes of 
the productions that are left factored and write 
them as alternatives. 



Then, it will be multiple productions, one for each 
suffix. 



E^TX 

X -> + E \ s 
T ->( E ) I int y 

y^*T I s 
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LL(i) Parsing 

Table 

Example 



Left-factored grammar 



E 

T 





TX 

(E)lintY 



X 




Y 




+ E 
*T 



8 



8 




The LL(i) parsing table: 



next input token 





int 


* 


+ 


( 


) 


$ 


E 














X 














T 














y 















leftmost non-terminal 



rhs of production to use 
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LL(i) Parsing 

Table 

Example 

(Cont.) 



Considerthe [E, int] entry 

• "When current non-terminal is E and next input is 
int, use production E — » TX" 

• This can generate an int in the first position 

• Considerthe [Y,+] entry 

"When current non-terminal is Y and current token 
is +, get rid of Y" 

• Y can be followed by + only if Y — » c 
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Blank entries indicate error situations 




Considerthe E,^] entry 

• "There is no way to derive a string starting with * 
from non-terminal E" 
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Method similar to recursive descent, except 

• For the leftmost non-terminal S 

• We look at the next input token a 

• And choose the production shown at [S,a] 

A stack records frontier of parse tree 

• Non-terminals that have yet to be expanded 

• Terminals that have yet to matched against the input 

• Top of stack = leftmost pending terminal or non- 
terminal 

Reject on reaching error state 
Accept on end of input & empty stack 
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initialize stack ; 
repeat 

case stack of 
<X, rest> 




<t, rest> 
until stack == 



<S $> and next 



if T[X,*next] = Y^.Yn 
then stack <Y 1 ...Y n 

else error (); 
if t == *next ++ 
then stack <rest>; 
else error (); 
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Stack 




E $ 

TX $ 
intYX $ 
YX $ 
*TX $ 
TX $ 
intYX $ 
YX $ 
X$ 

$ 



Act on 



Input 
int * int $ 
int * int $ 
int * int $ 

* int $ 

* int $ 
int $ 
int $ 

$ 

$ 

$ 



TX 

intY 

terminal 

*T 

terminal 

intY 

terminal 

s 

8 

ACCEPT 
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Quiz 



Choose the next parse state given the grammar, parse table, and 
current state below. The initial string is: 

if true then { true } else { if false then { false } } $ 





if 


then 


else 


{ 


} 


true 


false 


~r | 


1 E 


if B then { E } E # 








£ 


B 


B 


8 1 


E 






else { E } 




8 






8 1 


1 B 












true 


false 





Current 

o 

o 

0 

0 



Stack 



E'$ 




else { E } $ 



Input 



else {if false then { false }} $ 




E 




if B then { E } E' 



B 



8 





else { E } 



8 



B 




true 



false 



E}$ 



else {if B then { E } E' } $ 



else { if false then { false }} $ 

if false then {false }}$ 
else { if false then { false } } $ 
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Definition 





Algorithm sketch: 

1. First(t) = { t } 

2. 86 First(X) 

• ifX^c 

• \fX^A 1 ... A n and c e First(Aj) for l < i < n 

3. First(a) c: First(X) if X^ A 1 ... A n a and c e First(Aj) 

for 1 < i < n 
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Recall the grammar 

E^TX X^ + E|c 

T — > ( E ) | intY Y^*T|e 



First Sets. 
Example 



First sets 
First( () = {(} 
First( )) = {)} 
First( int) = { int } 
First( + ) = { + } 
First( *) = {*} 



First( ) = {int, ( } 
First( E ) = {int, (} 
First(X ) = {+,£} 
First(Y ) = {*,£} 
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Computing 

Follow Sets 



Definition: 

Follow(X) = { 1 1 S P X 1 5 } 

Intuition 

• If X — » A B then First(B) c Follow(A) and 

Follow(X) cz Follow(B) 

• if B —>* c then Follow(X) c= Follow(A) 

If S is the start symbol then $ e Follow(S) 



Dr. Sherin ElGokhy 




Algorithm sketch: 

1. $ e Follow(S) 

2. First(P) - {c} c= Follow(X) 

For each production A 

3. Follow(A) c Follow(X) 

For each production A 




a X p where c e First(P) 
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Follow Sets. 
Example 



Recall the grammar 

E^TX 

T— » ( E ) | intY 

Follow sets 
Follow( + ) = { int, ( } 
Follow( ( ) = { int, ( } 
Follow(X) = {$,)} 

FollowO ) = {+,),$} 

Follow( int) = {*, +, ) 



X^ + E s 
Y — » *T e 



Follow( * ) = { int, ( } 
Follow(E) = {), $} 
Follow(T) = {+,),$} 
Follow(Y ) = {+,),$} 

$} 
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Consider non-terminal A, production A a, & token t 
T[A,t] = a in two cases: 



If a — »* t [3 

• a can derive a t in the first position 

• We say that t e First(a) 



If A 




a and a 




* 



c and 




* 



P A 1 5 



Useful if stack has A, input is t, and 
In this case only option is to get rid 

• Can work only if t can follow A 
derivation 




cannot derive 



of A (by deriving c) 
in at least one 



• We say t e Follow(A) 
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Construct a parsing tableT for CFG G 

For each production A — » a in G do: 

• For each terminal t e First(a) do 

T[A, t] = a 

• If c e First(a), for each t e Follow(A) do 

T[A, t] = a 

• If c e First(a) and $ e Follow(A) do 

T[A, $] = a 
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Notes on 

LL(i) 

Parsing 

Tables 



If any entry is multiply defined then G is not LL(i) 

This means: 

• If G is ambiguous 

• If G is left recursive 

• If G is not left-factored 

• And in other cases as well 

Most programming language CFGs are not LL(i) 
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